PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa05g044470.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 733aa    MW: 81180.4 Da    PI: 7.6475
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa05g044470.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.81.8e-1865120156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     ++kR +++++q++e+e++F+++++p+ ++r+ L ++lgL+  q+k+WFqN+R++ k
  Csa05g044470.1  65 KKKRNRHSQYQIQEMEAFFRECPHPDDKQRTVLGRQLGLKPIQIKFWFQNKRTQNK 120
                     79***************************************************998 PP

2START158.84.1e-502474671206
                     HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                     ela e ++el+++a+++ep+W   +      +n  e+ ++f  + +     +++ea+r++++v+m+++ +ve l++ +  W++++     +a t+
  Csa05g044470.1 247 ELAIESMEELLLMAQVDEPLWIGGVngtsLALNWNEYERTFRMGLGprlsgFRIEASRETALVPMNPTGVVEMLMQAN-LWSTMFVgmvgRAVTH 340
                     57899*****************999887756777777777755555888899**************************.**************** PP

                     EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-E CS
           START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehv 170
                     e++ ++      galq m+ae+q lsplvp R+++fvRy++q ++  w++vdvS+d+   +       +++++pSg+li++ +ng+skvtwvehv
  Csa05g044470.1 341 EKLLPDvtgnfnGALQIMSAEYQDLSPLVPtRESYFVRYCKQ-RDNLWAVVDVSIDHFVTNIH----TKCRRRPSGCLIQEIPNGYSKVTWVEHV 430
                     ******************************************.**************998874....78889*********************** PP

                     E--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 171 dlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                     ++++r   +++++++v++g+a++a++wvatl+r ce+
  Csa05g044470.1 431 EVDDREAyQNIFKHFVSTGQAFAANRWVATLERRCER 467
                     *****999***************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.95E-1852123IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.7E-1960125IPR009057Homeodomain-like
PROSITE profilePS5007116.71462122IPR001356Homeobox domain
SMARTSM003894.1E-1764126IPR001356Homeobox domain
PfamPF000464.6E-1665120IPR001356Homeobox domain
CDDcd000868.88E-1765123No hitNo description
PROSITE profilePS5084838.252238470IPR002913START domain
SuperFamilySSF559616.6E-30239469No hitNo description
CDDcd088753.34E-105242466No hitNo description
PfamPF018522.5E-42247467IPR002913START domain
SMARTSM002342.0E-45247467IPR002913START domain
Gene3DG3DSA:3.30.530.205.1E-5302432IPR023393START-like domain
SuperFamilySSF559616.0E-11525689No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 733 aa     Download sequence    Send to blast
MSESNMVPVG NNGDTDNKEN NVMNNTDGGA CVGAGSGAEE IDSAKTVSDN REEAMGSSEG  60
PPRKKKKRNR HSQYQIQEME AFFRECPHPD DKQRTVLGRQ LGLKPIQIKF WFQNKRTQNK  120
NHQEHCENIG LRSLNKKLSS ENQRFREAVF LALCPKCGGQ TAIGEMCSEE HQLRIQNARL  180
TDEINELTAL AESFTRKMVM SPRPFHRPPT FEFRAGSSTG RELYGNGGNL SRGITGPADA  240
DKPMIKELAI ESMEELLLMA QVDEPLWIGG VNGTSLALNW NEYERTFRMG LGPRLSGFRI  300
EASRETALVP MNPTGVVEML MQANLWSTMF VGMVGRAVTH EKLLPDVTGN FNGALQIMSA  360
EYQDLSPLVP TRESYFVRYC KQRDNLWAVV DVSIDHFVTN IHTKCRRRPS GCLIQEIPNG  420
YSKVTWVEHV EVDDREAYQN IFKHFVSTGQ AFAANRWVAT LERRCERIAG NNIMTTEFQS  480
VDSADRLALT DHGKMSILKL AERVARSFFI GLTNSMGTPI SGVGGKGIRV MKMKNLNGPG  540
RPPGVVISAS TSFWVPCPPT IVFDFLRNEN HRANWDALCN GWIVQKISEI ANGRDSRNCA  600
TLLKLQNKST CQMNKMIIQE TFTDPTASFV IYAPVDTASI EGLLNLGTDP DYVALLPSGF  660
AILPDGMGDQ PGGNGGGSLL TVSLQMLVEE VSLAKLSISS ARSVENLIRA TVMRIKDLFP  720
FNIATPSTRN VS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16267RKKKKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010510869.10.0PREDICTED: homeobox-leucine zipper protein HDG3-like isoform X1
SwissprotQ9ZV650.0HDG3_ARATH; Homeobox-leucine zipper protein HDG3
TrEMBLR0HIB70.0R0HIB7_9BRAS; Uncharacterized protein
STRINGAT2G32370.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G32370.10.0homeodomain GLABROUS 3